The University of Amsterdam at WePS3
نویسندگان
چکیده
In this paper we describe our participation in the Third Web People Search (WePS3) evaluation campaign. We took part in the Online Reputation Management (ORM) task. Ambiguity of organization names (e.g., “Amazon” or “Apple”) raises obvious difficulties for systems that attempt to trace mentions of and opinions about a specific company in Web data, in an unsupervised manner. Problems are further amplified in the context of user generated content, where proper capitalization of named entities is often absent. The ORM task, introduced this year, addresses this very problem, by setting out the following challenge: given a set of Twitter entries containing an (ambiguous) company name and given the homepage of the company, discriminate entries that do not refer the company. Given the above definition, it is natural to formulate the problem as a binary classification task. Our focus was on building a general organization classifier that predicts, for each tweet, whether it is about a company. Our goal is to assess how a system without external aid from other sources (the company’s homepage, Wikipedia entry, etc.) can perform. We, therefore, focus on extracting features that are organization-independent and build on the characteristics of Twitter, such as noisy text, abbreviations and Twitter-specific language. Specifically, we trained a J48 decision tree classifier using the following groups of features: (i) company name (matching based on character 3-grams), (ii) content value (whether the tweet contains URLs, hashtags or is part of a conversation), (iii) content quality (ratio of punctuation and capital characters), (iv) organizational context (ratio of words found in tweets labelled as positive). We submitted a single run that performed around the median of all submitted systems. One interesting observation that requires further investigation is that our F-score for the negative class was substantially higher than for the positive class (0.55 vs. 0.36); for other teams it was usually the other way around. In future work we plan to build company-specific models by exploiting content both from Twitter and from external sources. Acknowledgements This research was supported by the Netherlands Organisation for Scientific Research (NWO) under project number 612.061.815 and partially by the Center for Creation, Content and Technology (CCCT).
منابع مشابه
Global Problem of Hospital Detention Practices
Although an official definition by the World Health Organization (WHO) or any other authority is currently lacking, hospital detention practices (HDP) can be described as: “refusing release of either living patients after medical discharge is clinically indicated or refusing release of bodies of deceased patients if families are unable to pay their hospital bills.” Reports of HDP are very scarc...
متن کاملOptimization of callus production and organogenesis of two commercial cultivars of Gladiolus (Gladiolus grandiflorus L. cv Amsterdam, Advance Red)
Gladiolus is cultivated as a high-value ornamental plant in the worldwide and it has the highest cultivation area and the highest level of underground organs imports in Iran. In this study, in order to investigate indirect regeneration of Amsterdam and Advance Red cultivars an experiment was conducted in a completely randomized design with three replications. For callogenesis, bud sprout (apica...
متن کاملSocial Accountability in Maternal Health Services in the Far-Western Development Region in Nepal: An Exploratory Study
Background Social accountability or citizen-led accountability has been promoted in many low- and middle-income countries to improve the quality, access to and use of maternal health services. Experiences with social accountability in maternal health services in Nepal have not yet been documented. This study identifies existing social accountability structures and activities in maternal h...
متن کاملRadiographic Predictors for Short-term Functional Outcome after Radial Head Arthroplasty in Patients with Persistent Symptoms after Treatment for Radial Head
Background: Evaluation of the accurate position after radial head arthroplasty remains a challenge for surgeons.Standard radiographs are used to evaluate the position of the implant, however, results regarding radiographicdeficiencies on clinical outcome are not consistent. In this retrospective study our main aim was to determine if subtleradiographic deficiencies after radia...
متن کاملIncreased Osteogenic Potential of Pre-Osteoblasts on Three-Dimensional Printed Scaffolds Compared to Porous Scaffolds for Bone Regeneration
Background: One of the main challenges with conventional scaffold fabrication methods is the inability to control scaffold architecture. Recently, scaffolds with controlled shape and architecture have been fabricated using 3D-printing. Herein, we aimed to determine whether the much tighter control of microstructure of 3DP PLGA/β-TCP scaffolds is more effective in promoting osteogenesis than por...
متن کامل